DNA nanotweezers studied with a coarse-grained model of DNA 
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We introduce a coarse-grained rigid nucleotide model of DNA that reproduces the basic thermo- 
dynamics of short strands: duplex hybridization, single-stranded stacking and hairpin formation, 
and also captures the essential structural properties of DNA: the helical pitch, persistence length and 
torsional stiffness of double-stranded molecules, as well as the comparative flexibility of unstacked 
single strands. We apply the model to calculate the detailed free-energy landscape of one full cycle 
of DNA 'tweezers', a simple machine driven by hybridization and strand displacement. 

PACS numbers: 87.14.gk,87.15.A-,81.07.Nb,34.20.Gj 



The field of DNA nanotechnology has grown rapidly in 
recent years as investigators have harnessed the selectiv- 
ity of DNA base pairing to form many different kinds of 
structures. Recent examples include: large ribbons PQ, 
two dimensional lattices [2] and polyhedra [3l H] , made by 
hybridizing systems of short strands (oligonucleotides). 
Another technique, DNA origami [5], uses short 'staple' 
strands to fold a long polynucleotide into almost any two- 
dimensional shape, and has recently been extended to 
three-dimensional structures [6l [7] . 

The free energy of hybridization can also be harnessed 
in artificial molecular machines. The simplest designs, 
such as DNA 'tweezers' |5], use alternating addition of 
two complementary strands to drive a system through 
a conformational cycle. More sophisticated autonomous 
machines catalyze the hybridization of strands initially 
present in inert forms, such as hairpins. Hybridization 
cycles can be coupled to a DNA track, creating DNA 
walkers that undergo directional motion [9j [10] . 

Computer simulations of these DNA nanosystems 
would provide highly desirable insight into the details 
of the processes involved in assembly or mechanical cy- 
cles. Unfortunately, the system sizes and time scales 
involved make all-atom simulations prohibitively expen- 
sive. Instead, models that coarse-grain out the micro- 
scopic details must be used. In the remainder of this 
letter we introduce a coarse-grained model designed to 
reproduce generic DNA behaviour in both single- and 
double-stranded states, as well as the fundamental as- 
sembly transitions. We then demonstrate the use of the 
model for DNA nanostructures by simulating a full cycle 
for DNA tweezers. 

A number of other coarse-grained models of DNA have 
been suggested in recent years. Non- helical models, with 
two interaction sites per nucleotide, have been applied 
to duplex [11], hairpin [12] [13] and four-arm junction 
formation [IT], as well as the gelation of oligonucleotide 
functionalized colloids [14]. Helical models with two [15] 
or three [I6j [17] sites per nucleotide have been used to 
study denaturation and hybridization of double-stranded 
DNA. However, to study the formation of nanostructures 
or the operation of hybridization- driven nanodevices, it 




FIG. 1: (colour online) A duplex as represented by the model, 
and a detailed view of a nucleotide. 



is essential to have a physically reasonable representation 
of both single and double-stranded states. Earlier mod- 
els either neglect the helicity of double-stranded DNA, or 
impose it through restrictions on the backbone of a sin- 
gle strand, which leads to an unphysical representation of 
single-stranded DNA. Furthermore, the thermodynamic 
properties of hybridization (particularly the widths of 
transitions) have not been well reproduced. 

We take a 'top-down' approach to DNA modeling, aim- 
ing to capture the generic properties of DNA that are 
important for assembly rather than reproducing all of 
the microscopic structural details. DNA is modelled as 
a string of rigid nucleotides, as depicted in Fig. [I] with 
one interaction site to represent the backbone and three 
for the base. The plane of the base is indicated by an 
additional 'normal vector'. The backbone sites are con- 
nected via FENE springs, and act as soft repulsion cen- 
tres (along with the base repulsion sites) to reproduce 
steric interactions. 

The helicity of our model results directly from the 
stacking interactions between base stacking interaction 
sites. Consecutive bases attract each other with a min- 
imum at approximately 3.4 A, shorter than the equilib- 
rium FENE spring length of approximately 6.5 A. We 
modulate this interaction according to the relative align- 
ment of the normal vectors, and the alignment of the 
normal vectors with the inter-site vector. Thus the sys- 
tem is driven towards forming helical stacks of coplanar 
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bases: right handedness is imposed by setting the attrac- 
tion to zero if the bases stack left-handedly. 

Hydrogen bonding is represented by an attraction be- 
tween hydrogen bonding sites of complementary bases, 
modulated by factors favouring co-linear nucleotides with 
antiparallel normals. With the stacking interaction, hy- 
drogen bonding drives the formation of right-handed dou- 
ble helices with the approximate geometry of B-DNA. We 
also include a cross-stacking interaction between bases 
that are diagonally opposite each other in a double he- 
lix, enabling the tendency of 'dangling ends' to stabilize 
duplexes to be reproduced. The complete form of all 
potentials can be found in the Ref. [T8] . 

For simplicity, several features of DNA have been ne- 
glected. Firstly, although only complementary bases can 
bond in our model, all bases are otherwise identical; at 
this stage we are interested in the generic properties of 
DNA assembly rather than specific base heterogeneity 
effects. Secondly, we fit the parameters using experimen- 
tal data at just one salt concentration, [Na] = 500 mM, 
where the Debye screening length is short (~ 4.5 A) and 
most properties are only weakly salt dependent. Finally, 
major and minor grooving are neglected. 

We simulate the model using the 'virtual move Monte 
Carlo' algorithm of Whitelam and Geissler [19]. Due 
to the system's simplicity and the efficiency of the algo- 
rithm, denaturation and hybridization of short duplexes 
can be observed without biasing the ensemble. To gain 
accurate statistics, however, we use umbrella sampling 
techniques [20] to characterize the basic DNA transitions. 

The simplest of these is single-stranded stacking, in 
which ssDNA undergoes a transition from an ordered, 
helical form at low temperature to a disordered struc- 
ture at high temperature [21 . Our model reproduces 
a broad, almost uncooperative transition with an en- 
thalpy of /\H stack = — 5.6kcalmol _1 and entropy of 
AS stack = -le.ekcalmor 1 K _1 , consistent with the ex- 
periments of Holbrook et al. [22] . 

We study duplex formation by simulating two comple- 
mentary strands in a box at an effective concentration 
of 0.317 mM, extrapolating to bulk using the method in 
Ref. [23 . We compare to melting temperatures (T m ) ob- 
tained from the nearest neighbor model of SantaLucia 
[24] , which is able to accurately predict experiments, for 
strands consisting of 'average bases' (defined by averag- 
ing over the parameters for all possible complementary 
base pair steps). Fig. [2] shows that our model is in excel- 
lent agreement with the predictions for T m over a range 
of duplex lengths. Importantly, transition widths are 
also consistent to within approximately 2 K, and thus the 
agreement in T m will hold over a range of concentrations. 

The third basic transition is hairpin formation, in 
which self-complementary strands bind to themselves to 
form a stem and hairpin loop. Our model underestimates 
T m relative to the nearest-neighbour model by approxi- 
mately 3K (less than 1% of the absolute temperature), 
but importantly captures the dependence on loop (Fig. 
[5| and stem length (not shown). 
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FIG. 2: (colour online) (a) Comparison of melting temper- 
atures as computed for our model (crosses connected by a 
solid line) and predicted by the nearest-neighbour model [24] 
(squares connected by a dashed line) for duplexes as a func- 
tion of the single-stranded length, and hairpins as a function 
of loop length for a stem of six bases, (b) Melting profile for 
an eight base duplex as predicted by our model (solid line) 
and the nearest- neighbour model (dashed line). 



In addition to thermodynamics, the model reproduces 
many of the physical properties of DNA essential for nan- 
otechnology. Model duplexes have a pitch of 10.4 base 
pairs per turn, a persistence length of 160 base pairs and 
an RMSD of 3.7° in the twist of each base pair rise. Un- 
stacked single strands are comparatively flexible, having 
a persistence length of 18.2 A (we define model length 
scales so that the average rise per base pair at 300 K is 
3.3 A). These values compare favourably with reported 
experimental results of 10.5 base pairs per turn [25], 135- 
150 base pairs [26], 3.9° [26] and 19.4 A[27 , respectively. 

Having demonstrated that our model reproduces the 
essential physics of DNA assembly, we apply it to 'DNA 
tweezers', a simple exampl of DNA hybridization driving 
conformationalchanges [8 . The cycle is shown in Fig. [3j 
with the tweezer unit switching between open and closed 
conformations as fuel (f) and antifuel (f) strands are se- 
quentially added, producing an ff duplex as waste. 

For simplicity we simulate a system approximately half 
the size of that originally used by Yurke et al [8 , with 
the sequences listed in Ref. [18 . The tweezers themselves 
consist of three strands (a hinge strand and two arms (a 
and (3)), forming two duplex regions of ten base pairs con- 
nected by a flexible, single-stranded hinge of four bases. 
At the end of the duplexes, there are overhanging single- 
stranded sections of eight bases. The fuel f is 24 bases in 
length, and is complementary to the overhanging regions 
of the tweezers, enabling it to bind to both and close the 
tweezers (Fig. [3^). The additional eight bases provide a 
'toehold' for binding of the antifuel f, which is also 24 
bases long and complementary to the whole of f. 

The tweezers, like many DNA based machines, rely 
on toehold- mediated strand displacement [28]. After the 
addition of f, the closed structure becomes metastable as 
the free energetic minimum of the system is an ff duplex 
isolated from the tweezers, f can bind to the toehold of 
f (Fig. |3|d)): f and a then compete for binding to the 
rest of f. By binding to available bases, f reduces the free 



FIG. 3: (colour online) Simulation snapshots showing stages of operation of DNA tweezers, a) Tweezers initially open, b) Fuel 
(f) is added and binds to one arm (/?). c) Fuel binds to the second arm (a) and closes the tweezers, d) Antifuel (?) is added 
and binds to the toehold of the fuel, e) Antifuel begins to displace first arm of the tweezers, f) Tweezers open as first arm is 
displaced, and antifuel starts to displace the second arm. g) Antifuel fully hybridizes to fuel and the waste duplex is formed. 



energy barrier for dissociation of f from a, thereby accel- 
erating the approach to equilibrium. Once a is displaced, 
the process is repeated with /?. 

We have sampled the free energy landscape of the sys- 
tem consisting of one set of tweezers and a single f and 
f, in a periodic cell of volume 4.19 x 10 5 nm 3 (Fig. [4]). 
Every stage of the cycle is observable using unbiased sim- 
ulations at 300 K. To obtain the free energy landscape, 
however, we split the order parameter space into umbrella 
sampling windows, which were then combined using the 
weighted histogram analysis method [29] . Further details 
on how the sampling was performed are given in Ref. [18] 

To study the cycle in detail, it is convenient to consider 
a one-dimensional pathway through the landscape; we 
use that shown by the arrows in Fig. [4] The free energy 
difference between 'a' and 'g' is 47. 12 ±0.21 kT along this 
path; simulations of f and f in isolation (displayed along 
y = in Fig. |]) give 47.27 ± 0.11 feT, the agreement 
supporting the accuracy of our calculations. 

The gross features of the free energy landscape are as 
expected. Duplex formation is highly cooperative; the 
pairing of two strands involves a high entropic cost for 
forming the first base pair, then a downhill slope in free 
energy as additional bonds are formed [17]. This is re- 
flected in Fig.[5]by stages 'b', £ c' and 'd' which essentially 
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FIG. 4: (colour online) Free energy F plotted as a function 
the number of f/f and f/tweezer base pairs for DNA tweezers 
at 300 K. White areas indicate high free-energy regions that 
were unsampled. 
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FIG. 5: (colour online) (a) Free energy profile along the 
one-dimensional pathway indicated in Fig. [4] Coordinates 
indicate the number of f/f and f/tweezer base pairs, (b) The 
displacement process 'e' in more detail. Squares represent the 
original system, circles a system with the tail of f unable to 
form a hairpin and crosses a system with the last eight bases 
of f and most of the {/ f3 arm removed (see text). 



involve duplex formation. The large cooper at ivity sug- 
gests that f will fully bind to one arm of the tweezers be- 
fore binding to the second. The displacement processes 
(indicated by V and 'f ' in Fig. [4| are comparatively flat 
as the total number of interstrand base pairs is constant. 
Returning the tweezers to the open state (between V and 
T) and the decoupling of the ff duplex from the tweez- 
ers ('g') release the free energy stored in bringing strands 
together, resulting in large decreases in free energy. 

Computer simulations allow for a detailed inspection 
of processes like displacement. Thus, Fig. [5] shows that 
there is actually an increase in free energy of ~3 kT dur- 
ing the displacement of the first strand a, even though 
the total number of interstrand base pairs in the sys- 
tem stays constant. The increase in free energy with dis- 
placement is initially steady, with a sharper jump after 
four bases, followed by another smooth increase. Con- 
versely, the displacement of the second strand (3 shows 
a steady decrease in free energy as more bases are dis- 
placed. These slopes suggest a significant difference in 
speed for the two processes: our unconstrained simula- 
tions show that the first displacement requires about 10 
times as many Monte Carlo moves, suggesting a slow 
displacement of the first arm, followed by a quicker dis- 
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placement of the second. 

Two effects help to explain the increase in free energy 
during the displacement of a. Firstly, f is capable of 
forming a hairpin structure, as shown in Fig.[3jd), which 
is marginally stable at 300 K. After the displacement of 
four bases of a, however, the hairpin can no longer form, 
leading to the observed step up in free energy. Simula- 
tions were performed in which the final eight bases of f 
were prevented from forming hairpins (Fig. |5jb)). These 
show no equivalent effect, confirming this explanation. 

Unless displacing strands are deliberately designed 
otherwise, it is likely that small hairpins will form, with 
the probability of accidental hairpins increasing with the 
length of the strand. The nearest-neighbour model of 
SantaLucia [24] predicts that hairpins with stems of three 
base pairs and short loops are marginally stable at 300 K, 
supporting the suggestion of our simulations that they 
can influence free energy profiles. Furthermore, these 
hairpins will form either at the start or end of displace- 
ment, when long single- stranded regions are available. 
As a consequence, hairpin formation will generally con- 
stitute a free energy barrier in the middle stages of dis- 
placement, thereby slowing down the process. 

The second reason for the increase of free energy comes 
from steric effects. On binding to the toehold of f, the un- 
bound end of f has its conformational freedom restricted 
by the presence of the rest of the tweezers. As displace- 
ment begins, a second single-stranded region is formed, 
causing further steric restrictions. As more bases are 
displaced, the single-stranded regions are drawn into the 
body of the tweezers, causing additional steric restriction 
as illustrated in Fig.|3|e). Computer simulations of a re- 



duced system in which the final eight bases of f (which are 
not involved in displacing a) and all but the first base pair 
of the f//3 duplex were removed (details in Ref. [18]) show 
a significantly flatter landscape after the initial penalty 
for forming two single-stranded regions, confirming this 
explanation (Fig.[5jb)). By contrast, the displacement of 
f3 by f reduces the amount of steric clashes as the tweezer 
unit is further separated from the f and f strands with 
each step, leading to a decrease in free energy during the 
displacement. 

Many of the features of the free-energy landscape — 
the sharp initial rise upon forming the first base pairs, 
or even the more subtle effects of hairpin formation and 
excluded volume on the displacement steps — are suffi- 
ciently generic that they would survive even if much more 
chemical detail was included in the simulations. Future 
model development will include the addition of base het- 
erogeneity effects and the explicit effects of salt concen- 
tration, but even at the current level we believe that our 
model will be particularly useful to study the design and 
operation of DNA nanomachines. Furthermore, we an- 
ticipate many potential applications for biologically rele- 
vant rearrangement transitions, such as the formation of 
cruciform DNA [25] . 

In summary, we have introduced a new coarse-grained 
model of DNA which reproduces its thermodynamic and 
structural properties, representing single-stranded stack- 
ing, duplex and hairpin transitions consistently for the 
first time. The model makes possible the simulation of 
DNA nanostructure assembly and nanomachine opera- 
tion, and has the potential to be extended into the bio- 
logical domain. 
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